alexott's Repositories
100 repositories
ace
Ace (Ajax.org Cloud9 Editor)
β 0
π Public
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
β 0
π Public
airflow-site
Apache Airflow Website
β 0
π Public
alexott.github.com
My site, http://alexott.net/
β 4
π Public
alia
High performance Cassandra client for clojure
β 0
π Public
anomaly_detection_using_databricks
No description
β 6
π Public
Awesome-SOAR
A curated Cyber "Security Orchestration, Automation and Response (SOAR)" awesome list.
β 1
π Public
azure-cosmos-db-cassandra-api-spark-connector-sample
Sample that provides guidelines and best practices for using the DataStax Spark Cassandra Connector against the Cosmos DB Cassandra API.
β 1
π Public
azure-cosmosdb-spark
Apache Spark Connector for Azure Cosmos DB
β 1
π Public
azure-event-hubs-spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
β 1
π Public
beats
:tropical_fish: Beats - Lightweight shippers for Elasticsearch & Logstash
β 1
π Public
beats-zerobus
Databricks Zerobus support in Elastic beats (filebeat, ...)
β 0
π Public
boost-asio-examples
Source code for examples from article "What is Boost.Asio, and why we should use it", http://alexott.net/en/cpp/BoostAsioNotes.html
β 85
π Public
π¦ Archived
boost-asio-proxy
Source code for examples from article "How to write simple HTTP proxy with Boost.Asio", http://alexott.net/en/cpp/BoostAsioProxy.html
β 81
π Public
cassaforte
Modern, high-level Clojure driver (client) for Cassandra build around CQL 3
β 0
π Public
cassandra
Mirror of Apache Cassandra
β 0
π Public
cassandra-dse-playground
Code samples for different components of DSE (DataStax Enterprise) & related technologies
β 5
π Public
cedet
My mirror of CEDET bzr repository (http://cedet.sf.net). Mostly used for experimental stuff, that will be merged into bzr version later
β 15
π Public
chaos
No description
β 0
π Public
chispa
PySpark test helper methods with beautiful error messages
β 0
π Public
clj-gsb
Clojure interface to Google's Safe Browsing API
β 1
π Public
clj-serializer
Fast binary serialization and deserialization for Clojure data structures
β 2
π Public
clj-tika
Clojure bindings to Apache Tika project
β 24
π Public
clojure
The Clojure programming language
β 1
π Public
clojure-course-ru-concurrency
Transcript & slides of lectures on concurrency in Clojure (for https://clojurecourse.by/)
β 3
π Public
clojure-examples
Different examples in Clojure - for articles, blog postings, etc.
β 7
π Public
clojure-hadoop
Library to aid writing Hadoop jobs in Clojure.
β 98
π Public
clojure-hbase-schemas
Schema-based HBase Interaction
β 2
π Public
clojure-libs
Different libraries for clojure
β 6
π Public
clojure-opennlp
Natural Language Processing in Clojure (opennlp)
β 1
π Public
clojure-semantic
Experiments with Emacs semantic.el and Clojure
β 0
π Public
courses
fast.ai Courses
β 0
π Public
cpp-tesing-examples
Examples for article on Unit testing with C++
β 19
π Public
cql-mode
Emacs mode for work with CQL (Cassandra Query Language)
β 2
π Public
cyber-spark-data-connectors
Cybersecurity-related custom data connectors for Spark
β 2
π Public
dabs-playground
Different examples around Databricks Asset Bundsls (DABs)
β 3
π Public
dasl-content-packs
No description
β 1
π Public
databricks-api
A simplified, autogenerated API client interface using the databricks-cli package
β 1
π Public
databricks-cicd-definitelynotademo
No description
β 1
π Public
databricks-cybersecurity-playground
Different pieces of code related to doing cybersecurity on Databricks
β 4
π Public
databricks-dbt-playground
Playing with DBT on Databricks
β 4
π Public
databricks-nutter-repos-demo
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
β 152
π Public
databricks-playground
Code samples, etc. for Databricks
β 73
π Public
databricks-repos-proxy
No description
β 0
π Public
databricks-sdk-go
Databricks SDK for Go
β 0
π Public
databricks-sdk-java
Databricks SDK for Java
β 0
π Public
databricks-sdk-py
Databricks SDK for Python
β 1
π Public
databricks-sql-connector-unofficial
Unofficial sources for Databricks SQL connector (until it's officially published)
β 1
π Public
π¦ Archived
databricks-sql-python
Databricks SQL Connector for Python
β 1
π Public
datalake-ADLS-access-patterns-with-Databricks
No description
β 1
π Public
datastax-bootcamp-project
Source code for project from DataStax's bootcamp
β 4
π Public
π¦ Archived
db-demo-project
demo project
β 0
π Public
dbx
CLI tool for advanced Databricks jobs management.
β 1
π Public
dbx-stable-url
A small Terraform Repository to create Stable URL infrastructure in AWS and Azure.
β 0
π Public
db_dlt_workshop
Databricks Delta Live Tables Workshop
β 0
π Public
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
β 1
π Public
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Sparkβ’ and big data workloads.
β 2
π Public
Delta-Live-Tables-Hands-on-Workshop
Delta Live Tables Workshop Resources
β 1
π Public
delta-live-tables-notebooks
No description
β 0
π Public
delta-live-tables-playground
Examples of Databricks Delta Live Tables
β 0
π Public
delta-sharing
An open protocol for secure data sharing
β 1
π Public
dlt-files-in-repos-demo
Demonstration of using Files in Repos with Databricks Delta Live Tables
β 35
π Public
dnks-terraform-lab
Curated collection of reusable Terraform snippets, samples, blueprints, examples, etc.
β 0
π Public
dns-analytics
No description
β 0
π Public
dotemacs
My personal Emacs configuration
β 3
π Public
dsbulk
DataStax Bulk Loader (DSBulk) is an open-source, Apache-licensed, unified tool for loading into and unloading from Apache Cassandra(R), DataStax Astra and DataStax Enterprise (DSE)
β 0
π Public
dse-java-playground
Playing with different pieces of DSE Java driver
β 2
π Public
π¦ Archived
dse-search-tools
Utility classes for work with DSE Search
β 0
π Public
π¦ Archived
ecb
!! It was moved to https://github.com/ecb-home/ecb !!!
β 99
π Public
emacs-addons
Repository of my packages (either written by me, or hacked by me)
β 2
π Public
emacs-configs
My personal Emacs configuration
β 255
π Public
emacs-guide-ru
No description
β 14
π Public
empythy
Automated NLP sentiment predictions- batteries included, or use your own data
β 0
π Public
fastbook
Draft of the fastai book
β 0
π Public
galimatias
galimatias is a URL parsing and normalization library written in Java.
β 0
π Public
gatling-dse-examples
Examples of using gatling-dse-plugin & gatling-dse-stress
β 0
π Public
geomesa
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
β 0
π Public
graph
Practical Gremlin - An Apache TinkerPop Tutorial
β 0
π Public
graphframes
No description
β 1
π Public
hbc
A Java HTTP client for consuming Twitter's Streaming API
β 0
π Public
incubator-sedona
A cluster computing framework for processing large-scale geospatial data
β 0
π Public
infer
inference and machine learning in clojure
β 16
π Public
java-driver
DataStax Java Driver for Apache Cassandra
β 0
π Public
JFastText
Java interface for fastText
β 0
π Public
kafka-connect-twitter
Kafka Connect Source for Twitter
β 1
π Public
kafka-connect-twitter-1
Kafka Connect connector to stream data in real time from Twitter.
β 0
π Public
kafka-streams-experiments
Experiments with Kafka Streams
β 0
π Public
kafka-streams-playground
A few examples for Kafka Streams
β 0
π Public
ksql-exps
Experiments with KSQL
β 0
π Public
lein-hadoop
leiningen plugin for generating hadoop-compatible jars
β 2
π Public
lein-simple-project
Example of project, that uses Leiningen
β 3
π Public
merchant-classification
This series of notebooks shows how the Lakehouse for Financial Services enables banks, open banking aggregators and payment processors to address the challenge of merchant classification
β 0
π Public
migrate
No description
β 0
π Public
mlflow
Open source platform for the machine learning lifecycle
β 0
π Public
mlflow-webhook-azure-devops
No description
β 4
π Public
muse
Emacs MUSE
β 56
π Public
neo4j-spark-connector
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
β 1
π Public
NETL-Automatic-Topic-Labelling-
Generating labels for topics automatically using neural embeddings
β 1
π Public
nlp_model_selection_app
No description
β 0
π Public
nlu
1 line for hundreds of NLP models and algorithms
β 1
π Public